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Abstract. The four moment theorem asserts, roughly speaking, that the joint 
distribution of a small number of eigenvalues of a Wigner random matrix (when 
measured at the scale of the mean eigenvalue spacing) depends only on the 
first four moments of the entries of the matrix. In this paper, we extend the 
four moment theorem to also cover the coefficients of the eigenvectors of a 
Wigner random matrix. A similar result (with different hypotheses) has been 
proved recently by Knowles and Yin, using a different method. 

As an application, we prove some central limit theorems for these eigenvec- 
tors. In another application, we prove a universality result for the resolvent, 
up to the real axis. This implies universality of the inverse matrix. 



1. Introduction 

Consider a random Hcrmitian n X n matrix M n with n real eigenvalues (counting 
multiplicity) 

Ai(M„) < ... < A„(M„). 
By the spectral theorem, one can find an orthonormal basis 

Ul (M n ),...,u n (M n ) G S^'cC" 

of unit eigenvectors of this matrix: 

M nUi {M n ) = Xi(M n )ui{M n ), 

where S* 2 ™ -1 := {z G C™ : \z\ — 1} is the unit sphere of C™, and we view elements 
of C" as column vectors. We write Ui :P (M n ) — (ui(M n ), e p ) for the p th coefficient of 
Ui(M n ) for each 1 < i,p < n. If M n is not Hermitian, but is in fact real symmetric, 
then we can require the zii(M n ) to have real coefficients, thus taking values in the 
unit sphere S 1 "" 1 := {x G R" : |x| = f } of R". 

Unfortunately, the eigenvectors Ui{M n ) are not unique in either the Hermitian or 
real symmetric cases; even if one assumes that the spectrum of M n is simple, in the 
sense that 

Ai(M„) < ... < X n (M n ), 
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one has the freedom to rotate each m(M n ) by a unit 1 phase e^ 10 € U(l). In the 
real symmetric case, in which we force the eigenvectors to have real coefficients, 
one only has the freedom to multiply each Ui{M n ) by a sign ± <E 0(1). However, 
one can eliminate this U(l) phase ambiguity or O(l) sign ambiguity (in the case of 
simple spectrum) by a adopting a variety of viewpoints: 

(i) One can consider the rank one projection operators 



instead of the eigenvectors Ui(M n ), thus the pq coefficient Pi, p , q (M n ) of 
Pi{M n ) is given by the formula 



(ii) One can consider the equivalence class {ui{M n )\ := {e^~^ Ui{M n ) : 9 e 
R} € S 2 "" 1 /U'(l) (or {±Ui{M n )} e S' n - 1 /0(1), in the real symmetric 
case) of all eigenvectors with eigenvalue Ai, instead of considering any one 
individual eigenvector. 

(hi) One can perform the ad hoc normalization of requiring u itP (M n ) to be 
positive real, where p is the first index for which Ui tP (M n ) 7^ (generically 
we will have p — 1 , and as we will see shortly, for Wigner matrices we will 
also have p = 1 with high probability). 

(iv) One can perform the random normalization of replacing Ui(M n ) with a 
randomly chosen rotation e v/ ~ Te «i(M n ) (in the Hermitian case) or ±Ui(M n ) 
(in the real symmetric case) (each the random phase or sign being chosen 
independently of each other, and (if M n is itself random) of M n ). 

Note that Pi, [ui(M n )], the ad hoc normalized Ui(M n ), and the randomly normal- 
ized Ui(M n ) in viewpoints (i)-(iv) respectively will be uniquely defined as long as the 
spectrum is simple (indeed, it suffices to have Aj_i(M„) < Aj(M„) < X i+ i(M n )). 
In the proofs of our main results, we shall adopt viewpoint (i) (which is natural 
from the perspective of spectral theory). However, in order to express our results 
in explicit coordinates, we will adopt the more ad hoc viewpoint (iii) or the random 
viewpoint (iv) in the statements of our results. 

Notations. We consider n as an asymptotic parameter tending to infinity. We use 
X < Y, Y > X, Y = Q(X), or X = 0(Y) to denote the bound X < CY for all 
sufficiently large n and for some constant C. Notations such as X <Cfe Y, X = Ok (Y) 
mean that the hidden constant C depend on another constant k. X = o(Y) or 
Y = uj(X) means that X/Y — > as n — > 00; the rate of decay here will be allowed 
to depend on other parameters. 

We will need some definitions that capture the intuition that a certain event E 
occurs very frequently. 

Definition 1 (Frequent events). Let E be an event depending on n. 



We use V — 1 to denote the imaginary unit, in order to free up the symbol i as an index 
variable. 



Pi(M n ) := Ul (M n )ui(M n )* 



Pi,P,q( M n) 



Ui,p(M n )u iiq (M n ). 
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• E holds asymptotically almost surely if P(E) = 1 — o(l). 

• E holds with high probability if P(E) > l — 0(n~ c ) for some constant c > 0. 

• E holds with overwhelming probability if P{E) > 1 — Oc(n~ c ) for every 
constant C > (or equivalently, that P(E) > 1 — exp(— w(logn))). 

• E holds almost surely if P(-B) = 1. 

The goal of this paper is to understand the distribution of the eigenvectors tij(M n ) 
(as normalized using viewpoint (iii) or (iv), for sake of concreteness) of a class of 
random matrix ensembles known as Wigner random matrices. 

Let us first identify the class of matrices we are working with. 

Definition 2 (Wigner matrices). Let n > 1 be an integer (which we view as a 
parameter going off to infinity) . An n x n Wigner Hermitian matrix M n is defined 
to be a random Hermitian n x n matrix M n = {£ij)i<i,j<ni in which the £y for 
1 < i < 3 < ^ a re jointly independent with ^ = £y. For 1 < i < j < n, we require 
that the £jj have mean zero and variance one, while for 1 < i = j < n we require 
that the &j (which are necessarily real) have mean zero and variance a 2 for some 
(T 2 > independent of i,j,n. (Note that we do not require the £y to be identically 
distributed, either on or off the diagonal.) 

We say that the Wigner matrix ensemble obeys condition CO if we have the expo- 
nential decay condition 

P(|^|>t C )<e- 4 

for all 1 < i,j < n and t > C , and some constants C,C (independent of i,j,n). 
We say that the Wigner matrix ensemble obeys condition CI with constant Co if 
one has 

E|^| Co <C 
for some constant C (independent of n). 

Of course, Condition CO implies Condition CI for any C , but not conversely. 

An important example of a Wigner matrix ensemble is the Gaussian unitary en- 
semble (CUE), which in our notation corresponds to the case when ^ has the 
complex normal distribution N(0, l)c with mean zero and variance one for i > j, 
and £ij has the real normal distribution N(0, 1)r with mean zero and variance one 
for i = j. This ensemble obeys Condition CO, and hence also Condition CI for 
any Co- Another important example is the Gaussian orthogonal ensemble (GOE), 
in which ^ has the real normal distribution iV(0, 1)r for i > j and the real normal 
distribution N(0, 2)r for i = j. 

The GOE and GUE are continuous ensembles, and it is therefore not difficult to 
show that the spectrum of such matrices are almost surely simple. The GUE en- 
semble is invariant under conjugation by unitary matrices, which implies that each 
unit eigenvector Ui{M n ) is uniformly distributed on the unit sphere S 2 ™ -1 of C n , 
after quotienting out by the U{1) action Ui(M n ) — > e^~^- Ui{M n ) (or equivalently, 
that [ui{M n )\ is uniformly distributed using the Haar measure on S' 2 ™ _1 /J7(l)). In 
particular, Pi{M n ) is uniformly distributed on the space of rank one projections 
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in C™. Similarly, for GOE, the unit eigenvector Ui(M n ) is uniformly distributed 
on the unit sphere of M™ (after quotienting out by the O(l) action m(M n ) — > 
±Ui(M n )). The same argument in fact shows that for GUE, the unitary matrix 
(u\(M n ), . . . , u n {M n )) is distributed according to Haar measure on the unitary 
group U(n) (after quotienting out by the left-action of the diagonal unitary matrices 
U(l) n ), and for GOE, the orthogonal matrix (ui(M n ), . . . ,u n {M n )) is distributed 
according to Haar measure on the orthogonal group 0(n) (after quotienting out by 
the left-action of the diagonal sign matrices 0(1)"). In particular, if one uses the 
random normalization (iv), then (u\(M n ), . . . , u n {M n )) will be distributed accord- 
ing to Haar measure on U(n) (in the GUE case) or 0(n) (in the GOE case). 

The distribution of coefficients of a matrix distributed using the Haar measure 
on the unitary or orthogonal groups has been studied by many authors [2], [16], 
[11], [23], [5], [4], [3], [12], [13]. It is known (sec [2]) that each coefficient, after 
multiplication by y/n, is asymptotically complex normal (in the unitary case) or real 
normal (in the orthogonal case). In fact the same is true for the joint distribution 
of multiple coefficients: 

Theorem 3. [12], [13] Let (u\, . . . , u n ) be distributed using Haar measure on 0(n). 
For 1 < i,p < n, let u ijP be the p th coefficient of Ui^ p . Let = N(0, 1)r for 
1 < hP < k be iid real gaussian random variables. 

(i) Ifk = o( v / n), then (\AiMi,p)i<i,p<fc and (£i :P )i<i, P <fc differ by o(l) in vari- 
ation norm. In other words, if F : M. m — > R is any measurable function 
with ||F|| L00 j Cfc 2^ < 1, then 

(1) |EF((Vnu ) I < o(l) 

where the error is uniform in F. 

(ii) If instead we make the weaker assumption that k = o(n/ logn), then it is 
possible to couple together Ui tP and such that sup 1<i p<fe \ \fnui^ p — 
converges to zero in probability. 

Similar conclusions hold for the unitary group U(n), replacing the real gaussian 
N(0, 1)k by the complex gaussian N(0, l)c- 

In [12] it is also shown that the hypotheses k = o(y / n), k = o(nj logn) in the 
above two results are best possible. Of course, by symmetry, one can replace the 
top left k x k minor (u.^ p )i<.^ p <k of the orthogonal matrix (u\, . . . , u n ) with any 
other k x k minor and obtain the same results. 

As a corollary of Theorem 3, we obtain an asymptotic for the distribution of 
eigenvector coefficients of GUE or GOE (normalizing using viewpoint (hi)). 

Corollary 4 (Eigenvector coefficients of GOE and GUE). Let M n be drawn from 
GOE, and let (u\(M n ), . . . , u n (M n )) be the eigenvectors (normalized using view- 
point (Hi); this is almost surely well-defined since the spectrum is almost surely 
simple). For 1 < i,p < n, let be independent random variables, with £ij = 
N(0, 1) R for p > 1 and = \N(0, 1) R | for p = 1. Let 1 < k < n, and let 
1 < i\ < . . . < ik < n and 1 < pi < . . . < pk < n be indices. 
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(i) Ifk = o( v / n), then {\fnu ta . Pb {M n ))i< a ^< k and {€i a , Pb )i< a ,b<k differ by o{\) 
in variation norm. 

(ii) If instead we make the weaker assumption that k — o(nj log n), then it is 
possible to couple together M n and^i^p such that sup 1<0 b<k \^/nui a ^ Pb {M n ) — 
Cia.Pbl converges to zero in probability. 

Similar conclusions hold for CUE, replacing the real gaussian N(0, 1)r by the 
complex gaussian N(0, l)c- If one uses the random normalization (iv) instead of 
(Hi), then the conclusions are the same, except that the absolute values are not 
present in the definition o/£j ; i. 

The main objective of this paper is to develop analogues of Corollary 4 for the 
more general Wigner ensembles from Definition 2. In particular, we would like to 
consider ensembles M n which are allowed to be discrete instead of continuous. 

One immediate difficulty that arises in the discrete setting is that one can now 
have a non-zero probability that the spectrum is non-simple. However, we have the 
following gap theorem from [17], [18], [19]: 

Theorem 5 (Gap theorem). Suppose that M n is a Wigner random matrix obeying 
Condition CI with a sufficiently large constant Co, and let cq > be independent 
of n. Write A n :— y/nM n for the rescaled matrix. Then for any 1 < i < n, one has 

P(\X l+1 (A n ) - Xi(A n )\ < n- C0 ) < n~ Cl 

for all sufficiently large n, where C\ > depends only on c . 

Proof. In the bulk case en < i < (1 — e)n assuming Condition CO, see [17, Theorem 
19]. For the extension to the edge case, see [18, Theorem 1.14]. For the relaxation 
of Condition CO to Condition CI (with a sufficiently large Co), see [19, Section 2]. 
See also [7] for some related level repulsion estimates (assuming some additional 
regularity hypotheses on M n ). □ 

Of course, one has Xi(A n ) = y/n\i(M n ). From the above theorem and the union 
bound, we see that there is an absolute constant c > such that if m = 0(n c ) 
and 1 < ii < . . . < i m < n, and M n obeys Condition CI with a sufficiently 
large Co, then with probability 1 — 0(n~ c ), the eigenvalues Xi 1 (M n ), . . . , Xi m (M n ) 
will all occur with multiplicity one, so that one can meaningfully normalise the 
eigenvectors u i± (M n ), . . . , u im (M n ) according to any of the viewpoints (i), (ii), (iii), 
(iv) mentioned previously, outside of an exceptional event of probability 0(n~ c ). 
On that exceptional event, we define the orthonormal eigenvector basis Ui(M n ) 
(and related objects such as the rank one projection Pi(M n )) in some arbitrary 
(measurable) fashion. 

1.1. Main results. We now turn to the distribution of the coefficients of the eigen- 
vectors. As the ensembles M n may be discrete, the eigenvector coefficients Uij(M n ) 
may be discrete also, and so one does not expect to have convergence in variation 
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distance any more. Instead, we will consider the weaker notion of vague conver- 
gence, which resembles the condition (1), but with F now required to be compactly 
supported, and either continuous or smooth. We need the following definition: 

Definition 6 (Matching moments). Let k,l > 1. Two Wigner random matrices 
M„ = (Cij)i<i,j<n and M' n — (£^-)i<ij< n are said to match to order k off the 
diagonal, and match to order I on the diagonal, if one has ERc(£ ij ) a im(£ i .,) fc = 
ERe(^ J )°Im(^- J ) b whenever a, b > and 1 < i < j < n are integers such that 
a + b < k (if i < j) or a + b < I (if i = j). 

We can now give our first main result, which partially extends Corollary 4 to other 
Wigner ensembles: 

Theorem 7 (Eigenvector coefficients of Wigner matrices). There are absolute con- 
stants 5, Co > such that the following statement holds. Let M n be drawn from 
a Wigner ensemble obeying Condition CI with constant Co that matches GOE to 
order 4 off the diagonal and to order 2 on the diagonal ( which, in particular, forces 
M n to be almost surely real symmetric) , and let (ui(M n ), . . . , u n (M n )) be the eigen- 
vectors (normalized using viewpoint (Hi)). For 1 < i,p < n, let be independent 
random variables, with £ ip = N(0, I)jj for p > 1 and p = \N(0, 1)r| for p = 1. 
Let 1 < k < n, and let 1 < i\ < . . . < < n and 1 < pi < . . . < p^ < n be indices. 

(i) (Vague convergence) Lf k — 0(n s ), then one has 

(2) EF((Vn«i„ ■,Pb )l<a,b<k )\=o(l) 

whenever F : C k — > M is a smooth function obeying the bounds 

\F(x)\=0(l) 

and 

\W 3 F(x)\ = 0(n s ) 

for all x G C fe2 and < j < 5. 

(ii) Lf instead we make the stronger assumption that k = 0(1), then it is possi- 
ble to couple together M n and such that sup 1<a b<fe \\fnu iatPb (M n ) — 
U a ,Pb\ converges to zero in probability. In particular, this implies that 
(y/nu la . Pb ) 1 < a ^ b < k converges to (£i a , Pb )i<a,b<k in distribution. 

The bounds are uniform in the choice of i\, . . . ,ife,pi, • ■ • ,Pk, and in the choice of 
F, provided that the implied constants in the bounds on F are kept fixed. Simi- 
lar conclusions hold for CUE, replacing the real gaussian iV(0, 1)r by the complex 
gaussian N(0, l)c. 

If one uses the random normalization (iv) instead of (Hi), then the conclusions are 
the same, except that the absolute values are not present in the definition of £i t ±. 

The bound k = 0(1) in (ii) can be extended by our method (with some effort) to 
k = o (log 1/2 n), but this still falls far short of the analogous range in Corollary 4. 
It is reasonable to expect that these bounds are not best possible (particularly if 
one places some additional regularity hypotheses on the coefficients of M n ). 
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We will deduce Theorem 7 from Corollary 4 by establishing a four moment theorem 
for eigenvectors, which is the main technical result of this paper: 

Theorem 8 (Four Moment Theorem for eigenvectors). There is an absolute con- 
stant Co > such that for every sufficiently small constant c > there is a 
constant d > such that the following holds. Let M n , M' n be two Wigner random 
matrices satisfying CI with constant Co that match to order 4 off the diagonal and 
to order 2 on the diagonal. Let k < n s , and let G : M. k x C fe — > M be a smooth 
function obeying the derivative bounds 

(3) \^G(x)\ < n C0 

for all < j < 5 and x G R fe x C k . Then for any 1 < i\, i 2 , ■ ■ ■ , ik < n and 
1 < Pi j • • • ,Pk, 1i, ■ ■ ■ > Qk < an d for n sufficiently large depending on e,5,Co, Co 
we have 

(4) |EG($(v^M„)) - EG(*(v^M;))| < n' C0 
where for any matrix M of size n, $(M) e K fe x C fe is i/ie topfe 

$(M) := ((A ia (M)) 1 < a < fe ,(nP iaiPo , ga (M)) 1 < a < fc ). 
The bounds are uniform in the choice ofi\,..., ik,Pi, ■ ■ ■ ,Pk, Qi, ■ ■ ■ , qk- 



The deduction of Theorem 7 from Corollary 4 and Theorem 8 is routine and is 
performed in Section 2. 

Theorem 8 is an extension of the four moment theorem for eigenvalues estab- 
lished in [17], [18], [19]. Indeed, the latter theorem is essentially the special case 
of Theorem 8 in which k — 0(1), and G only depends on the hrst k components 
(v / ^Ai o (M rl ))i< a <fe of <1>(M„). Unsurprisingly, the proof of Theorem 8 will rely 
heavily on the machinery developed in [17], [18], [19]. 

Theorem 8 was announced at the AIM workshop "Random matrices" in December 
2010. We have found out that recently a result in the same spirit has been proved 
by Knowles and Yin [14] using a somewhat different method. Knowles and Yin 
handled the k = 0(1) case with Condition CI replaced by the stronger Condition 
CO. Furthermore, they needeed control on k + 5 derivatives of G rather than just 5 
derivatives, and also a level repulsion hypothesis similar to (6), (7) below. On the 
other hand, their result holds for generalized Wigner matrices. 

The need for four matching moments in Theorem 8 is believed to be necessary; 
sec [20]. However, we conjecture that Corollary 4 continues to hold without the 
matching moment hypothesis. 

We can use Theorem 8 (together with other tools) to obtain a four moment theorem 
for the resolvent (or Green's function) coefficients 

/ 1 \-l n 1 

where A n is the rescaled matrix A n := sfnM n (so in particular Ui(A n ) — Ui(M n ) 
and Pi(A n ) = Pi{M n )). 
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More precisely, we have 

Theorem 9 (Four moment theorem for resolvents up to the real axis). Let M n , M' n 
be two Wigner random matrices satisfying CO that match to order 4 off the diagonal 
and to order 2 on the diagonal for some sufficiently large Co . Let z = E + in for 
some E € M and some rj > 0. We assume the level repulsion hypothesis that for 
any c> 0, one has with high probability that 

(6) inf \Xi(y/nM n ) -nz\> n~ c 

l<i<n 

and 

(7) inf |Ai(VnAf;) - nz\ > n~ c . 

l<i<n 

Let 1 < p,q < n. Then for any smooth function G : C — > C obeying the bounds 
W^G{x) = 0(1) for all x e C and < j < 5, one has 

EG (i^M n zl)-^j EG ((-^M; - zl)-^ = 0(n" c °) 

for some constant Co > independent of n. 

We isolate the z = case of this theorem as a corollary: 

Corollary 10 (Four moment theorem for the inverse matrix). Under the conditions 
of Theorem 9, we have 




We prove Theorem 9 in Section 5; it is established by combining Theorem 8 with 
an eigenvalue rigidity result from [10] and a local semicircle law from [9]. This result 
generalizes a similar four moment theorem from ([9] ( [9, Theorem 2.3]). Its main 
strengths as compared against that result are that r\ is allowed to go all the way to 
zero. In particular, one can take z to be zero, thus giving control of the coefficients 
of the inverse matrix M~ x . On the other hand, the result in [9, Theorem 2.3] does 
not require the hypothesis (6), (7), and allows the entries in M n (or M' n ) to have 
different variances, and the bounds are slightly sharper. 

Remark 11. The same proof allows one to control the joint distribution of k 
coefficients of several resolvents with k = 0(n s ) for some sufficiently small S > 0, 
assuming a level repulsion estimate at each energy z. 



The hypotheses (6), (7) are natural, as when these claims fail one would expect 
the resolvent to be unusually large. However, in practice such hypotheses are in 
fact automatic and can thus be omitted. For instance, we have 

Proposition 12 (Sufficient conditions for level repulsion). Let M n be a Wigner 
matrix whose entries £ij are identically distributed in the off-diagonal 1 < i < j < n, 
and also identically distributed on the diagonal 1 < i — j < n. Let z G C. 
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(i) If the real and imaginary parts of the off-diagonal coefficients of M n are 
iid and supported on at least three points, and M n obeys Condition Clfor 
a sufficiently large Co, then (6) holds. 

(ii) If z = and M n obeys Condition Clfor Co — 4, then (6) holds. 

Proof. For (i), see [21]; an earlier result assuming smoothness and decay on the 
coefficients (but without the requirement of iid real and imaginary parts) was es- 
tablished in [7] (see also [15] for a more refined result). The claim (ii) was recently 
established (by a rather different method) in [22] □ 

It is in fact likely that (6) and (7) in fact always hold whenever Condition Clis 
satisfied for a sufficiently large Co, but we do not attempt to establish this fact 
here. 

As another application, we can use our new results to obtain central limit theorems 
concerning eigenvectors. Here is a sample result. 

Theorem 13. Let M n be a random symmetric matrix obeying hypothesis CI for 
a sufficiently large constant Co, which matches the Gaussian Orthogonal Ensem- 
ble(GOE) to fourth order. Assume furthermore that the atom distributions of M n 
are symmetric (i.e. £ij = — £ij for all 1 < i,j < n), and identically distributed 
( or more precisely, that the £jj for i > j are identically distributed, and the £ij for 
i = j are also identically distributed) . Let i = i n be an index (or more precisely, a 
sequence of indices) between 1 and n, and let a = a n e S" 1-1 be a unit vector in E™ 
(or more precisely, a sequence of unit vectors). Assume either that 

(a) Ui(M n ) is normalized using the procedure (Hi), and a ■ e\ — o(l); or 

(b) Ui(M n ) is normalized using the procedure (iv). 

Then ^/n,Ui(M n ) ■ a tends to N(0, 1)r in distribution as n — » oo. 

Note that if one normalizes Ui(M n ) using procedure (iii), then Ui{M n ) ■ e\ = 
Ui,i(M n ) will be non-negative (and thus cannot converge in distribution to N(0, 1)r), 
which helps explain the presence of the condition a ■ e\ in condition (a). 

As an example to illustrate Theorem 13, we can take a = a n := ^=(1) •••,!) € 
5 n_1 , and i := [n/2\ . Then Theorem 13 asserts that the sum of the entries of the 
middle eigenvector uy n / 2 \(M n ) (using either normalization (iii) or normalization 
(iv)) is gaussian in the limit. 

We prove Theorem 13 in Section 4 as a consequence of Corollary 4 and a general 
central limit theorem on averages of approximately independent symmetric random 
variables (Proposition 25) which may be of independent interest. It should be 
possible to extend this result to more general ensembles M n (and to obtain the 
analogous results for ensembles that match CUE rather than GOE), and to obtain 
central limit theorems for the joint distribution of several statistics of the form 
^/nui{M n ) ■ a n , but we do not pursue this matter here. 
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Remark 14. The four moment theorem can be extended to handle the singular 
values of iid covariance matrices, instead of the eigenvalues of Wigner matrices; 
see [19]. It is possible to use that extension to establish an analogue of Theorem 
8 for the singular values and singular vectors of such matrices, and an analogue 
of Theorem 9 for the inverses of such matrices (which, as is well known, can be 
expressed in terms of the singular value decomposition of the matrix). We omit the 
details. 

2. Proof of Theorem 7 

We establish the claim in the GOE case only, as the GUE case is similar. We shall 
also establish the claim just for the normalization (iii), as the normalization (iv) 
can be treated similarly (or deduced directly from the results for (iii)). 

Let Co be as in Theorem 8, let 5 > be sufficiently small, and let M n , m(M n ), . . . , u n (M n ), 
k, ii, . . . , ik,Pi, ■ ■ ■ ,Pk, and F be as in that theorem. Let M' n be drawn 
from GOE, and write A n := y/nM n and A' n := \fnM' n . We initially assume that 
k = 0{n s ). By adding a dummy index and relabeling if necessary, we may assume 
without loss of generality that pi = 1. 

The normalization given by viewpoint (iii) is unstable when it^i is close to zero, 
and so we will need to eliminate this possibility first. To this end, let F : C fc — > R 
be a function of the form 

k 

F (x u ...,x k ) := H(l-x(n ws x a )) 

a=l 

where \ '■ C — > [0, 1] is a smooth cutoff which equals one on the ball B(0, 1) but 
vanishes outside of B(0, 2). Observe from the union bound that 

EF ((|6 a4 | 2 )i<a<fc) =1-0(1) 

and thus by Corollary 4, we have an analogous estimate for the quantities nPi i i (A' n ) = 
(VEu iail (A' n )) 2 : 

EJb((nP io , 1;1 (A / J) 1 < a < fe ) = l-o(l). 

Applying Theorem 8 (assuming that 5 is sufficiently small depending on Co, so that 
the losses of coming from bounding the derivatives of F can be absorbed into 
the n~ c ° factor) we conclude that 

(8) EF («,Ll(^n))l<aa) = 1 - o(l), 

and thus we have 

inf JnP w (A n ) | > n- 10S 

or equivalently that 

(9) inf V^u taA (A n )>n- 56 

l<a<k 

asymptotically almost surely. 
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Now we can prove part (i) of Theorem 7. From (8) one has 

E|l-JM(n^.a,i(AO)i<„< fc )| =o(l) 
and thus (by the boundedness of F) 

E (1 - F ((nP ioi i,i(Ai))i<o<fc)) F ((v / ^^ a ^(^n))i<a,fc<fe) = o(l). 

A similar argument gives 

E (1 - F ((\^i\ 2 )i<a<k)) F (Ki oJ ,Jl<„,6< ro ) = o(l). 
Thus, to prove (2), it suffices by the triangle inequality to show that 

EF ((v^«i. lP6 {A n ))l<a,b<k) - EF 

■,Pb )l__ a ;b<&;) — 

0(1) 

where F : C* 2 -> C is the function 

^((a;o,6)i<a,6<fc) = F ((|a; a ,i| 2 )i< a < fc ) F ((x a ,b)i<a,b<k) ■ 

From Corollary 4 we have 

EF ( K {\/ r nu ia ^ b (A l n ))i<a,b<k) - EF ((£ io 

,Pi )l<a,6<fc) 

0(1) 

so it suffices to show that 

EF ({^U latPb (A' n ))i< afi < k ) - F (( V / «W Ja ,p b (^n))l<a,fc<fe) = o(l). 

But observe that in the support of F, ^Jnui at \ is non-zero (and indeed, we have (9)), 
and the components \/nUi a> j b can then be recovered from the projection coefficients 
n Pi a ,Pb,i — nu i a ,Pb u ia,i by the formula 

/ — nP% a .p b .l 

Vnu iaiPb - 



y/nP iaAtl 
Thus we can write 

l<a,b<k) 

where 

Xg,b 



G ((x a ,b)l<a,b<k) ■— F ^ 

' l<a,b<k/ 

with the understanding that the right-hand side vanishes when any of the x aj i 
vanish. Similarly for A n . From construction we can easily verify that 

\V j G(x)\ =0(n°W) 

for all < j < 5. The claim (i) now follows from Theorem 8 (assuming 5 sufficiently 



small depending on Co). 

Finally, we prove Claim (ii) of Theorem 7. Assume that k = 0(1). Let e = 
e(n) = o(l) > be a slowly decaying function of n to be chosen later. Observe that 
asymptotically almost surely, one has 

k k 

EEi^ p j 2 = o(i/ £ 2 ), 

a=l 6=1 
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and thus by (i), we conclude that asymptotically almost surely, we also have 

k k 

EEim. p j 2 = o(i/ £ 2 ), 

a=l 6=1 

thus the vector u := (\Ai M i a ,pJi<a,6<fe asymptotically almost surely lies in a ball B 
of radius 0(1/ e) in . Since k = 0(1), we may thus cover this ball by 0(e~ ^) 
disjoint half-open cubes Q\, . . . , Q m of sidclcngth e; thus 

P |«e \\ Q *) =1 ~ o(1) ' 

Write £ := (£i a , Pb )i<a,b<k- Applying Claim (i) (approximating the indicator func- 
tions lQ i from above and below by smooth functions) and using the continuous 
nature of £, we see that 

m 

2|P(ffeQ i )-P(l'eQ j )l=o(l) 

i=l 

if e is sufficiently slowly decaying in n. From this, we see that we may couple u and 
£ together in such a fashion that with probability 1 — o(l), u and £ lie in the same 
cube Qi, which in particular implies that u — £ = o(l). The claim follows. 

Remark 15. By using the polynomial decay rate 0(n~ c ) in the bounds appearing 
in the proof of Claim (i), it is possible to relax the condition k = 0(1) in Claim (ii) 
to k = o(log 1//2 n), as this allows us to absorb factors of the form 0(exp(0(fc 2 ))) 
arising from the high dimension of the domain into the 0(n~ c ) factors. We 
omit the details. 

3. Proof of four moment theorem 

In this section we prove Theorem 8. We will follow closely the arguments from 
[17], [18], [19]. 

3.1. Heuristic discussion. Let us first review the general strategy from [17] for 
handling the eigenvalues. As in [17], we introduce the normalised matrices 

A n := y/nM n ,A' n = V^M' n 

(whose mean eigenvalue spacing is comparable to 1). For sake of exposition let us 
restrict attention to the case k = 1, thus we wish to show that the expectation 
EG(Xi(A n )) of the random variable G(Xi(A n )) only changes by O(n~ co ) if one 
replaces A n with another random matrix A' n with moments matching up to fourth 
order off the diagonal (and up to second order on the diagonal) . To further simplify 
the exposition, let us suppose that the coefficients £ P9 of A n (or A' n ) are real- valued 
rather than complex- valued. 

At present, A' n differs from A n in all n 2 components. But suppose we make a 
much milder change to A n , namely replacing a single entry \fnQ vq of A n with its 
counterpart y/n(' pq for some 1 < p < q < n. If p ^ q, one also needs to replace the 



RANDOM MATRICES: UNIVERSAL PROPERTIES OF EIGENVECTORS 



13 



companion entry ^/nCqp = V™(pq with V^-Cqp — V™(pq > to maintain the Hermitian 
property. This creates another random matrix A n which differs from A n in at 
most two entries. Note that A n continues to obey Condition CI, and has matching 
moments with either A n or A' n up to fourth order off the diagonal, and up to second 
order on the diagonal. 

Suppose that one could show that EG(Xi(A n )) differed from F,G(Xi(A n )) by at 
most n~ 2 ~ c " when p ^ q and by at most n _1_c ° when p = q. Then, by applying 
this swapping procedure once for each pair 1 < p < q < n and using the triangle 
inequality, one would obtain the desired bound |EG(Aj(A„)) — EG(Xi(A' n ))\ = 
0{n- Ca ). 

Now let us see why we would expect EG(Xi(A n j) to differ from EG(Xi(A n )) by 
such a small amount. For sake of concreteness let us restrict attention to the off- 
diagonal case p 7^ q, where we have four matching moments; the diagonal case 
p = q is similar but one only assumes two matching moments, which is ultimately 
responsible for the n _1_c ° error rather than n~ 2 ~ c °. 

Let us freeze (or condition on) all the entries of A n except for the pq and qp 
entries. For any complex number z, let A{z) denote the matrix which equals A n 
except at the pq, qp, entries, where it equals z and z respectively. (Actually, with 
our hypotheses, we only need to consider real-valued z.) Thus it would suffice to 
show that 

(10) EF(V^Cpq) = EF(v^C; 9 ) + 0(n- 2 - c °) 

for all (or at least most) choices of the frozen entries of A n , where F(z) := 
G(Xi(A(z))). (A standard argument allows us to restrict attention to values of 
z of size 0(^/2+0(1/00)) ) 

Suppose we could show the derivative estimates 

(11) 1J F W = 0(n- l+0 ^ +0 ^ c ^ +0 ^) 

for I = 1,2, 3, 4, 5. Then by Taylor's theorem with remainder, we would have 
F{z) = F(0) + F'(0)z + ... + 1fW(0)z 4 + 0(n- 5 + ( c °)+°( 1 /Co)+o(i)| z |5 ) 

and so in particular (using the hypothesis z = 0(n 1 / 2+0 ( 1 / c ' ))) 

F(V^Cpq) = F(0)+F'(Q)^(pq+. . .+^F( 4 )(0) v ^ 4 C p 4 9 +O(n- 5 / 2+o ( c ») + °( 1 / c <') +o ( 1 )) 

and similarly for F(^C pq ). Since n -5/2+0(co)+0(i/c )+ (i) = 0(n" 2 - c °) for C 
and n large enough and cq small enough, we thus obtain the claim (10) thanks to 
the hypothesis that the first four moments of ( pq and C,' pq match. (Note how this 
argument barely fails if only three moments are assumed to match, though it is 
possible that some refinement of this argument might still succeed by exploiting 
further cancellations in the fourth order term jiF^ (0)y/n 4 (p q -) 

We can use exactly this strategy for eigenvectors, replacing Aj by Pi lPiq (say). A 
key point here is that the derivatives of F is computed based on the basic relation 
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A n Ui — XiUi and the chain rule. Thus, in order to bound the derivatives of F 
(for the four moment theorem for eigenvalues), we needed to obtain estimates for 
the derivatives of both Aj and Uj. The same estimates can be used to bound the 
derivatives of F in the proof of the four moment theorem for eigenvectors. In order 
to make k as large as n n ^\ one needs to follow the proof closely and make certain 
adjustments. 



3.2. Formal proof. We now turn to the details. The first step is to truncate away 
the event that an eigenvalue gap is unexpectedly small. Define 



Ql[An) : ~5|A,04„)-MA„)p- 
This quantity is usually bounded from above: 
Lemma 16 (Bound on Q). If c > 0, then one has 

Q la (A n )<n Ca 
with high probability for all 1 < a < k. 



Proof. See [17, Lemma 49]. Strictly speaking, this lemma assumed Condition CO 
and was restricted to the bulk of the spectrum, but this was solely because at the 
time of writing of that paper, the gap theorem (Theorem 5) was only established 
in that setting. Inserting Theorem 5 as a replacement for [17, Theorem 19] in the 
proof of [17, Lemma 49], we obtain the claim. □ 

In view of this lemma (and its obvious counterpart for M^), we can (if k = 0{n s ) 
for a sufficiently small 8) deduce the four moment theorem from the following 
truncated version: 

Theorem 17 (Truncated Four Moment Theorem for eigenvectors). There is an 
absolute constant Co > such that for every sufficiently small constant c\ > 
there is a constant S > such that the following holds. Let M n ,M' n be two Wigner 
random matrices satisfying Cl with constant Co that match to order 4 off the 
diagonal and to order 2 on the diagonal. Let k < n s , and let G : M k x M fc x C fe — »■ K 
be a smooth function obeying the derivative bounds 

(12) \V J G(x)\ < n Cl 

for all < j < 5 and x G R fc x M fe x C k , and which is supported on the region 
{((x a )i<a<k, (y a )i<a<k, (z a b)i<a,b<k) ■ \Va\ < n Cl for all 1 < a < k} . 

Then for any 1 < ii, 12, ■ ■ ■ , ik < n and 1 < Pi, ■ ■ ■ ,Pk, Qi , ■ ■ ■ , Q_k < n, and for n 
sufficiently large depending on e, 5, c (and the constant C in Definition 2) we have 

(13) |EG(l>(A n )) - EG($«))| < n- Cl 
where for any matrix A n , $>(A n ) e K fe x C fe is the tuple 



$(Ai) := {{K a {An))l<a<k,{Q t AAn))l<a<k{nP ta {An) PcL ,q a )l<a<k) ■ 
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The reduction of Theorem 8 to Theorem 17 using Lemma 16 proceeds exactly as 
in [17, §3.3] and is omitted 2 . 

As in [17], we adopt the Lindeberg strategy of swapping each matrix entry of M n 
(and its transpose) one at a time. A key definition is that of a good configuration, 
which is a slightly modified version of the same concept from [17]. Let £i, Ci be pa- 
rameters (independent of n) to be selected later. For a given n, we fix k, i\, . . . , ik, G 
as in Theorem 17. 

Definition 18 (Good configuration). For a complex parameter z, let A(z) be a 
(deterministic) family oftixn Hcrmitian matrices of the form 

A(z) = A(0) + ze p e* q + ze q e* p 

where e p , e q are unit vectors. We say that A(z) is a good configuration if for every 
1 < a < k and every \z\ < n x / 2+ei whose real and imaginary parts are multiples of 
we have the following properties: 

• (Eigenvalue separation) For any 1 < i < n with \i — i a \ > n £l , we have 

(14) \\i(A(z)) - X ia (A(z))\ >n-^\i-i a \. 

• (Derealization) There exists an orthonormal eigenfunction basis u\ (A(z j), . . . , u n (A(z)) 
such that 

(15) sup \ui tj (A(z))\ «n- 1 / 2+£l 

l<i.j<n 

To show Theorem 17, it then suffices to establish the following two claims. The 
first claim, which we shall establish shortly, is an analogue of [17, Proposition 46], 
and involves a single (deterministic) good configuration A(z): 

Proposition 19 (Replacement given a good configuration). Suppose that C\ is 
sufficiently large, and let e\ > 0. Let A(z) be a good configuration. Then one has 

|EG(*(A(C))) - EG($(A(C'))| = 0{n- < - r+1 ^ 2+0 ^ +0 ^) 

whenever (, (' are random complex variables that match to order r for some r — 
2,3,4, and bounded almost surely by 0(n 1 / 2+£l ). 

The second claim is an analogue of [17, Proposition 48]: 

Proposition 20 (Good configurations occur very frequently). Let S\ > and 

C\ > 1, and assume that Co > 1 is sufficiently large depending on e\. Let A(0) = 
{Cij)i<i,j<n be a random Hermitian matrix with independent upper-triangular en- 
tries and |Cij| < n 1/ ' 2+10/ ' c ' for all 1 < i,j < n, with ( pq — ( qp = 0, but with Qj 
having mean zero and variance n for all other ij , and also being distributed contin- 
uously in the complex plane. Then A(0) is a good configuration with overwhelming 
probability. 



2 Note that now that k is as large as 0(n s ), some factors of 0{n°^) may be lost in the bounds, 
but this can be absorbed by the n _ci gain in the conclusion of Theorem 17. 
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Proof. This is almost identical to the arguments 3 in [17, §5]. Those arguments 
already give (14) with overwhelming probability. To obtain the derealization of 
eigenvectors, one can use [18, Proposition 1.12] (see also the proof of [17, Corollary 
63] to deal with the fact that the pq and qp entries are not random) . □ 

With these two propositions, we can establish Theorem 17 by first using Condition 
CI to truncate to the case when M n , M' n have entries of size O(n 10 ^ c °) (say), and 
perturbing them to be continuous, and then replacing the entries of M n with M' n 
one at a time just as in [17, §3.3]. Because the entries of M n and M' n match 
to order 4 off the diagonal and to order 2 on the diagonal, each of the 0(n 2 ) off- 
diagonal replacements costs an error of 0(n- 5 / 2+ °( £l ) , while each of the 0{n) 
diagonal replacements costs an error of 0(n~ 3 ^ 2+o ^ Sl ^ +o ^), and so the net error is 
acceptable if ei, 5 are sufficiently small (and if Co is large enough that Proposition 
20 applies). 

It remains to establish Proposition 19. As in [17], we will need derivative bounds 
on various spectral statistics of A(z). For each 1 < i < n, we introduce the 
resolvent-type matrices 

Lemma 21. Let 1 < i < n, and let Ao be a Hermitian matrix which has a simple 
eigenvalue at Xi(A ). Then \i{A), Pi(A), Ri(A), and Qi{A) depend smoothly on 
A in a neighborhood of A . 

Proof. Sec [17, Lemma 54]. □ 

Now we turn to more quantitative bounds on derivatives. If f(z) is a scalar, 
vector, or matrix-valued function depending smoothly (but not holomorphically) 
on a complex parameter z, we define the derivatives V m f of / to be the vector (or 
tensor)-valued quantity 

d m f 



V m /(z) := 
We have a crude bound: 



dKc{z) l dlm(zY 



Lemma 22 (Crude bound). Let A = A(z) be an n x n Hermitian matrix varying 
(real) -linearly in z (thus \7 k A = for k>2), with 

\\VA\\ op <V 

for some V > 0. Let 1 < i < n. At some fixed value of z, suppose we have the 
spectral gap condition 

(16) |A,(A(z))-A t (,4(z))|>r 



The hypotheses in [17, §5] did not have the loss of 

n lO/C in 

the bound for Qj, but such 

bounds only cause losses of 0(n°( 1 / c ' )) in the final bounds, which can be absorbed into the n ei 
factors in the definition of a good configuration if Co is sufficiently large depending on ei. 
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for all j i and some r > (in particular, Xi(A(z)) is a simple eigenvalue) . Then 
for all k> 1 we have ( at this fixed choice of z) 



k r i-k 



(17) \V k K(A(m «fc V 
and 

(18) ||V fc P 4 (A(z))|| op « fe y fc r- fc 
and 

(19) UV^^IUp < fc V k r- k -\ 
and 

(20) |V fe g 4 |« fe nT/ fe r- fe - 2 . 

Proof. Sec [17, Lemma 56]. □ 



In practice, this crude bound is insufficient, and we will need the following more 
advanced bound: 

Proposition 23 (Better bound). Let A = A(z) be an n x n matrix depending on 
a complex parameter z of the form 

A{z) = A(0) + ze p e* q + ze q e* p 

for some vectors e p ,e q . We abbreviaate Aj = Xi(A(z)), Pi — Pi(A(z)), etc. 

Let 1 < i < n. At some fixed value of z, suppose that Aj = Xi(A(z)) is a simple 
eigenvalue, and that we have a partition 



l = p 1 + J2p« 

aeJ 

where J is a finite index set, and P a are orthogonal projections to invariant spaces 
on A (i.e. to spans of eigenvectors not corresponding to A, ). Suppose that on the 
range of each P a , the eigenvalues of A — Aj have magnitude at least r a for some 
r a > 0. Suppose also that we have the incompressibility bounds 
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for all a € JU {i} and some w > and d a > 1, with di := 1. Then at this value of 
z, and for all k>\, we have the bounds 




for all k > and all a, (3 e J at this value of z. Here \\T\\p ■— (trace TT*) 1 / 2 is 
the Frobenius norm of T . 

Proof. See [17, Corollary 58], which is a special case of [17, Lemma 57]. The 
bounds (22), (23), (24) do not appear explicitly in [17, Corollary 58], but come 
directly from the equations [17, (73), (74), (75)] in [17, Lemma 57] after plugging 
in the parameters indicated in the proof of [17, Corollary 58]. □ 

We can now prove Proposition 19 and thus Theorem 8. This will be a repetition 
of the material in [17, Section 4.3]; we sketch the main points here. 

Fix k > 1, r = 2,3,4 and S\ > 0, and suppose that C\ is sufficiently large. We 
assume A(0),e p , e q , ii, . . . , ik, G, F, C, (' are as in the proposition. 

We may of course assume that F(zq) ^ for at least one z with |z | < n 1/<2+£l , 
since the claim is vacuous otherwise. 

Using Taylor expansion and the chain rule exactly as in [17, Section 4.3], it suffices 
to show that 

Lemma 24. Suppose that F(z ) ^ for at least one z with \z \ < n 1 / 2+£l . Then 
for all z with \zo\ < n 1 / 2+£l , and all 1 < j < k, we have 

(26) \V m \ l3 {A{z))\ «n°( El » n - m 
and 

(27) n|V m P^, gj (A(z))| « n°^n- m 
and 

(28) \W m Q l3 (A(z))\^n° ( ^n- m 
for all z with \z\ < n 1/2+£l and all < m < 10. 
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The bounds (26), (28) were already proven in [17, Lemma 59]; the arguments in 
the proof of that lemma also show that 

Q l] (A(z)) 
uniformly for all z with \z\ < n 1 / 2+ei . 

Now we prove (27). Arguing inductively as in the proof of [17, Lemma 59], it 
suffices to establish the claim for z in the ball B(zo, n _1_2ei ). Let us first establish 
this for z whose real and imaginary parts are a multiple of n~ Cl . At this value of 
z, we can apply Proposition 23 exactly as in [17, Section 4.3] to obtain the bounds 

(29) ||^(V m PgP 4 J F «n- m +°( £l >( J2 7") m 

0<a<logn a 

(30) 

\\p!f(v m p^ 3 \\ F = wp^p^Af « ^V™+°(-)( ]r ^r- 1 

0<a<log n 

(31) ||P, (Q) (V m P i .)^ ) ||F« — n- m+ °^\ Y —) m - 2 

J 3 %i r a ra r a 

0<a<logn 

for all < m < 10, 1 < j < k, and < a, j3 < log n, where r a is the minimal value 
of \X i — \i j | for \i — ij\ > 2 a , and P^ is the spectral projection to those eigenvalues 
with 2" < < 2 a+1 . 

In [17, Section 4.3] it is shown that 

£ - a <<n ° (ei); 

0<a<log n 

in particular 
We conclude that 

(32) ll^ a) (V m i > i ,)i^ ,) ||F < 2- Q / 2 2- /3 / 2 n- m+ °( £l ) 

for all —1 < a, )3 < logn, where we adopt the convention that p\. ^ := Pj r Now, 
we expand 

(V m P 3 ) P3 , 9j - e ; 3 (V m P 3 )e 9j 

= E (^^(^(V^Pf )(Pf e ft ). 

— l<Q,/3<logn 

Applying the triangle and Cauchy-Schwarz inequalities we conclude that 

|(V™P,) P ,, 9 J < ]T \\e* Pj P£ \\\\P^ ) (V m P ij )P^ \\ F \\HP e qj \\. 

— l<a,/3<log n 

From the hypothesis (15) and Pythagoras' theorem we have 

\\e; 3 P!f\\«n^2^n-^ 

and similarly 

ll^ ) e 93 ||«n ei 2^ 2 n- 1 /2 
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so from (32) we obtain (27) as required, at least when the real and imaginary parts 
of z are multiples of n~ Cl . The general case can then be handled (for C\ large 
enough) by an appeal to Lemma 22 by arguing exactly as in [17, Section 4.3]. 



We now prove Theorem 13. The main tool is the following general central limit 
theorem: 

Proposition 25 (Central limit theorem). Suppose one has a random unit vector 
u = (ui, . . . , u n ) <G W 1 such that 

(i) (Exchangeability) The distribution of u is symmetric with respect to the 
permutation group S n . 

(ii) For any distinct i\, . . . , i k with k fixed, y/nu il7 . . . , y/nu ik converges jointly 
in distribution to k iid copies of N(0, 1) as n — > oo. 

(hi) (Symmetry) The distribution ofu is symmetric with respect to the reflection 
group { — 1, +1}™. 

Then for any unit vector a € 1" (depending on n, of course), y/na ■ u converges in 
distribution to N(0, 1)r. 

Indeed, part (b) of Theorem 13 follows immediately from this proposition and 
Corollary 4, noting from the symmetries of M n that the eigenvector Ui(M n ) is both 
symmetric and exchangeable. Part (a) also follows after first reducing to the case 
a ■ ei =0 (noting from Corollary 4 that Ui{M n ) ■ e\ converges in distribution to 
\N(0, 1)r|, so that (ui(M n ) ■ ei)(a ■ ei) converges in distribution to if a- e\ = o(l)). 

We now prove Proposition 25. Let A = A n be a quantity growing slowly to infinity 
that we will choose later. We truncate 

where Ui t < := Ujl| Mi |<A/yn anc ^ u i,> : = u ^\ui\> A / '^n- We then split u = u< + w> 
correspondingly. It will suffice to show that, for a suitable choice of A, 

(1) a ■ u< converges in distribution to iV(0, 1)r, and 

(2) a ■ it> converges in probability to zero. 

We first consider the second claim. Here we use the second moment method. It 
suffices to show that 

nE\a ■ U> \ 2 = o(l). 
Set a = (oi, \a n ). The left-hand side can be expanded as 



%=\ j=i 

Using the symmetry hypothesis (iv), we see that if i ^ j, then Ui^uj^ has a 
symmetric distribution and thus has mean zero. Thus only the diagonal terms 



4. Proof of Theorem 13 



n 



ii 
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contribute. Using hypothesis (i) and the unit normalization of a, the second moment 
becomes 

riE,u\ t> . 

On the other hand, from (i) we have 

nEw 2 = 1 

while from (ii) we have 

nEu?l| Ul |<* - EG 2 l lGl < K + o(l) 
for any fixed K, where G = N(0, 1), and thus (since A = A n goes to infinity) 
liminf nEu\l\ Ul \ <A > sup EG 2 l| G i<if = 1. 

We conclude that 

nEuj < = 1 - o(l) 

and 

nEuf > = o(l) 

and Claim 2 follows. 

Now we turn to Claim 1. Here we use the moment method. By Carleman's 
theorem (see e.g. [1]), it suffices to show that for each fixed positive integer k, 

n fe / 2 E(a • u<) k =EG k + o(l). 

The left-hand side expands as 

n k/2 ^ a h ■■■a ik Eu ilt < ...u ikt <. 

By the symmetry hypothesis (iii), the expectation vanishes unless each index i 
appears an even number of times. Using hypothesis (ii) (and (iii)), we see that 

n k/2 Eu iu < ...u ik> < = EG 4l ...G ik + o(l) 

where G\,...,G n are iid copies of ^(0,1), uniformly in if A n grows 

sufficiently slowly to infinity. Observe that 

«ii • • ■ aifcEGij . . . Gi k = E I a^Gi ) = EG* 

l<ii,....i k <n \i= 1 / 

since aiGi = G, and that (as before) the summands vanish unless each index 

i appears an even number of times. Thus it suffices to show that 



J2a n ...a ik = 0(1) 



where the sum * is over all fc-tuples 1 < i\, . . . , ik < n in which each index i appears 
an even number of times, and the implied constants in the OQ notation are allowed 
to depend on k. 

Suppose there are I distinct indices appearing, then the contribution of this case 
is at most 

o(±...±al...al 

\ii=i ji=i 
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(since we have a™ < aj r whenever m is a positive even number) . But as a is a unit 
vector, this sums to 0(1) as required. This concludes the proof of Proposition 25 
and hence Theorem 13. 

Remark 26. An extension of the above argument shows that, under the hypotheses 
of Proposition 25, if a 1 , . . . , a 1 are a bounded number of orthonormal vectors in R", 
then y/na 1 -u, . . . , y/na l -u converges jointly in distribution to I iid copies of N(0, 1)r. 



5. Proof of Theorem 9 



We now prove Theorem 9. Let M n , M' n ,C, z, E,rj,p,q be as in that theorem. 
Let ci > be a small constant to be chosen later, and let C2 > be an even 
smaller constant (depending on ci) to be chosen later. Write A n := y/nM n and 
A' n := yfiiM' n . 

Let us call an expression depending on M„ (or A n ) stable if it only changes by 
0(n~ c ) for some c > if M n is replaced by M' n . Our task is thus to show that 

EG [Cj^M n -zl) 1 ) is stable. 



VI 



We first subtract off the "global" portion of the resolvent (^=M n — zlj . Set 

z := E + m _1+C2 / 2 . Applying the local semicircle law from [9, Theorem 2.1], one 
has 4 

IV 1 

— M n - zqI = m sc (z )5pq + 0(n c ) 

J pq 

with overwhelming probability for some c > 0, where S pq is the Kronecker delta 
function and m sc (zo) is semicircular Stieltjes transform 



m sc {z ) := — 



2 

This type of result already gives the claim when r\ > n~ 1+C2 / 2 , so we may assume 
that r] < n~ 1+C2 / 2 . After shifting G by m sc {z )8 pq , and using the regularity bounds 
on G, it thus suffices to show that the expression 



1 „ A" 1 ( \ 



EG I [~^M n - zlj - \^-j=M n - z I 



is stable. 
From (5) we have 



(A=M n - zl) - (^=M n - z Q l) = J2 F(X t (A n ))nP^ q (A n ) 

\V n /no \V n J pq 



pq \ V / pq i=1 



^Strictly speaking, the statement of [9, Theorem 2.1] only controls the diagonal component 
p = q of the resolvent. However, an inspection of the proof of that theorem (see in particular 
[9, (3.13)] and [9, Proposition 3.3]) reveals that the off-diagonal components p ^ q were also 
controlled by the argument. 
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where 

F(x) := — — . 

x — nz x — nzo 

Note that F decays quadratically in x rather than linearly, due to the subtraction 
of the comparison term — - — ; this will allow us to easily neglect the contribution 
of the spectrum that is far from E in the rest of the argument. 

We now invoke the eigenvalue rigidity result from [10, Theorem 2.2]. Among other 
things, this theorem gives an interval [z_,z + ] C [1,n] of length i + — z_ = 0(n C2 ) 
(depending only on n, E, and C2) which contains most of the eigenvalues close to 
E, in the sense that we have 

inf \Xi(A n )-E\ >n C2 -°« 
ie[i,n]\[i-,i+] 

with overwhelming probability. In fact, we have the stronger assertion that 
\\i(A n ) -E\> n-°«(n C2 + dist(i, 

for all i € [1, n]\[i_,i+] with overwhelming probability. In particular, this implies 
that 

n c 2 /2+o(l) 

and thus 

(33) £ \F(^(A n ))\<n- c */ 2+ °^ 

ie[l,n]\[i_,i+] 

with overwhelming probability. 

Also, by the derealization of eigenvalues (established in the bulk in [7, Theorem 
4.8] (see also the earlier result [6, Theorem 1.2] handling the smooth case), and up 
to the edge in [18, Proposition 1.12]), we have 

K P (A n )\ = Otn-VW)) 
with overwhelming probability for all 1 < i,p < n, and hence 

nP l , p , q (A n ) = 0(n°«) 

with overwhelming probability for all 1 < i,p,q,< n. As such, we see that with 
overwhelming probability, we have 

1 „ .V 1 ( 1 



—=M n -zl) - —=M n - z I 

J pq VV" y pq 

= F(MA n ))nP hp ^A n ) + 0(n-^ 2 +°W) 

with overwhelming probability. Using the regularity bounds on G, it thus suffices 
to show that the quantity 

EG( ]T F(\i(A n ))nP it p iq (A n )) 

i£[i- 

is stable. 
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We let x '■ C — >• [0, 1] be a cutoff function supported on B(0, 1) that equals 1 on 
B(0, 1/2). Define the truncated version 

F(x) :=F(x)(l- X (n Cl (x-z))) 

of F. From the hypothesis (6), we see that for each i e one has 

F(\i(A n )) = F(\i(A n )) 

with probability at least 1 — O(n~ 10c2 ) (say), if C2 is sufficiently small depending 
on ci and on the implied constant in the high probability event (6). In particular, 
by the union bound, we have 

G{ Y F(*Mn))nPi, p , q (A n )) = G( Y FMAn))nP t , p , q (A n )) 

»£[»_,*+] i£[i_,i+] 

with high probability. 

Similarly for A' n using (7). It thus suffices to show that the quantity 
EG( Y H*Mn))nPi, p , q (A n )) 

»£[»_ ,»+] 

is stable. But this follows directly from Theorem 8 (if ci, C2 are small enough). The 
proof of Theorem 9 is now complete. 

Remark 27. The exponential decay hypothesis (Condition CO) in Theorem 9 is 
needed only to be able to use the eigenvalue rigidity result from [10] and the local 
semicircle law from [9]. It is quite likely that in both cases, one can relax Condition 
CO to Condition CI (conceding some factors of n ^ 1 / ^ in the process), which 
would then allow one to achieve a similar relaxation in Theorem 9. 



References 

[1] Z. D. Bai and J. Silverstein, Spectral analysis of large dimensional random matrices, Mathe- 
matics Monograph Series 2, Science Press, Beijing 2006. 

[2] E. Borel, Introduction geomctriquc a quclqucs theories physiques, Gauthicr-Villars, Paris, 
1906. 

[3] B. Collins, Integrates Matricielles et Probabilities Non-commutatives. Th'ese de Doctorat of 
Universite Paris 6, 2003. 

[4] P. Diaconis, M. Eaton, L. Lauritzen, Finite deFinctti theorems in linear models and multi- 
variate analysis. Scand. J. Statist. 19 (1992), 289-315. 

[5] P. Diaconis, D. Frccdman, A dozen de Finetti-style results in search of a theory. Ann. Inst. 
Henri Poincarc, 23 (1987), 397-423. 

[6] L. Erdos, B. Schlein and H.-T. Yau, Local semicircle law and complete derealization for 
Wigner random matrices, Comm. Math. Phys. 287 (2009), no. 2, 641655. 

[7] L. Erdos, B. Schlein and H.-T. Yau, Wegner estimate and level repulsion for Wigner random 
matrices. Int. Math. Res. Notices 2010 (2010), 436-479. 

[8] L. Erdos, B. Schlein and H.-T. Yau, Universality of Random Matrices and Local Relaxation 
Flow, arXiv.0907.5605 

[9] L. Erdos, H-T Yau, and J. Yin, Bulk universality for generalized Wigner matrices, 
arXiv.1001.3453 

[10] L. Erdos, H-T Yau, and J. Yin, Rigidity of Eigenvalues of Generalized Wigner Matrices, 
arXiv: 1007. 4652 

[11] L. Gallardo, Au sujet du contenu probabiliste d'un lemma d'Hcnri Poincare, Annals de 
I'universite de clemont, Vol. 69, (1983), 192-197. 



RANDOM MATRICES: UNIVERSAL PROPERTIES OF EIGENVECTORS 



25 



[12] T. Jiang (2006). How Many Entries of A Typical Orthogonal Matrix Can Be Approximated 

By Independent Normals?, Ann. Probab. 34, 1497—1529. 
[13] T. Jiang, The Entries of Haar-invariant Matrices from the Classical Compact Groups, Journal 

of Theoretical Probability, 23, 1227-1243. 
[14] A. Knowles, J. Yin, Eigenvector Distribution of Wigner Matrices, arXiv: 1 102.0057 
[15] A. Maltsev, B. Schlcin, Average Density of States for Hermitian Wigner Matrices, 

arXiv:1011.5594 

[16] A. J. Stam, Limit theorems for uniform distributions on high dimensional Euclidean spaces. 

J. Appl. Prob., 19 (1982), 221-228. 
[17] T. Tao and V. Vu, Random matrices: Universality of local eigenvalue statistics, 

arXiv.0906.0510. 

[18] T. Tao and V. Vu, Random matrices: Universality of local eigenvalue statistics up to the 
edge, Communications in Mathematical Physics, 298 (2010), 549—572. 

[19] T. Tao and V. Vu, Random covariancc matrices: Universality of local statistics of eigenvalues, 
arXiv.0912.0966. 

[20] T. Tao and V. Vu, Random matrices: Localization of the eigenvalues and the necessity of 
four moments, arXiv: 1005.2901. 

[21] T. Tao and V. Vu, The Wigner-Dyson-Mchta bulk universality conjecture for Wigner matri- 
ces, arXiv: 1101. 5707. 

[22] R. Vershynin, Invertibility of symmetric random matrices, arXiv:1102.0300. 

[23] M. Yor, Illegalities dc martingales continus arrctes a un temps quelconques I. Springer Lecture 
Notes in Math., No. 1118, 1985. 



Department of Mathematics, UCLA, Los Angeles CA 90095-1555 
E-mail address: taoamath.ucla.edu 



Department of Mathematics, Rutgers, New Jersey, NJ 08854 



E-mail address: vanvuamath.rutgers.edu 



